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ABSTRACT 


In  this  project  we  are  going  to  compare  the  performance  of  different  adders  implemented  to  the  multipliers 
based  on  area  and  time  needed  for  calculation.  The  CLAA  based  multiplier  uses  the  delay  time  of  99ns  for 
performing  multiplication  operation  where  as  in  CSLA  based  multiplier  also  uses  nearly  the  same  delay  time 
for  multiplication  operation.  But  the  area  needed  for  CLAA  multiplier  is  reduced  to  31  % by  the  CSLA  based 
multiplier  to  complete  the  multiplication  operation. 
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I.  Introduction 

To  humans,  decimal  numbers  are  easy  to 
comprehend  and  implement  for  performing 
arithmetic.  However,  in  digital  systems,  such  as  a 
microprocessor,  DSP  (Digital  Signal  Processor)  or 
ASIC  (Application -Specific  Integrated  Circuit), 
binary  numbers  are  more  pragmatic  for  a given 
computation.  This  occurs  because  binary 
values  are  optimally  efficient  at  representing  many 
values. 

Binary  adders  are  one  of  the  most  essential  logic 
elements  within  a digital  system.  In  addition, 
binary  adders  are  also  helpful  in  units  other  than 
Arithmetic  Logic  Units  (ALU),  such  as  multipliers, 
dividers  and  memory  addressing.  Therefore,  binary 
addition  is  essential  that  any  improvement  in 
binary  addition  can  result  in  a performance  boost 
for  any  computing  system  and,  hence,  help 
improve  the  performance  of  the  entire  system. 
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Figure  1 Binary  adder  example 


The  binary  adder  is  the  critical  element  in  most 
digital  circuit  designs  including  digital  signal 
processors  (DSP)  and  microprocessor  data  path 
units.  As  such,  extensive  research  continues  to  be 
focused  on  improving  the  power  delay  performance 
of  the  adder.  In  VLSI  implementations, 
parallel-prefix  adders  are  known  to  have  the  best 
performance.  Reconfigurable  logic  such  as  Field 
Programmable  Gate  Arrays  (FPGAs)  has  been 
gaining  in  popularity  in  recent  years  because  it 
offers  improved  performance  in  terms  of  speed  and 
power  over  DSP-based  and  microprocessor-based 
solutions  for  many  practical  designs  involving 
mobile  DSP  and  telecommunications  applications 
and  a significant  reduction  in  development  time 
and  cost  over  Application  Specific  Integrated 
Circuit  (ASIC)  designs. 

Ripple-Carry  Adders  (RCA): 

The  simplest  way  of  doing  binary  addition  is  to 
connect  the  carry-out  from  the  previous  bit  to  the 
next  bit's  carry-in.  Each  bit  takes  carry-in  as  one  of 
the  inputs  and  outputs  sum  and  carry-out  bit  and 
hence  the  name  ripple -carry  adder.  This  type  of 
adders  is  built  by  cascading  1-bit  full  adders.  A 
4-bit  ripple-carry  adder  is  shown  in  Figure  2.3. 
Each  trapezoidal  symbol  represents  a single -bit  full 
adder.  At  the  top  of  the  figure,  the  carry  is  rippled 
through  the  adder  from  CO  to  C4. 
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Figure  2 Ripple- Carry  Adders. 


Carry-Look-ahead  Adders  (CLA): 

The  carry-chain  can  also  be  accelerated  with 
carry  generate/propagate  logic.  Carry-look  ahead 
adders  employ  the  carry  generate /propagate  in 
groups  to  generate  carry  for  the  next  block.  In  other 
words,  digital  logic  is  used  to  calculate  all  the 
carries  at  once.  When  building  a CLA,  a reduced 
version  of  full  adder,  which  is  called  a reduced  full 
adder  (RFA)  is  utilized.  Figure  2.4  shows  the  block 
diagram  for  an  RFA.  The  carry  generate/propagate 
signals  gi/pi  feed  to  carry-look  ahead  generator 
(CLG)  for  carry  inputs  to  RFA. 
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II.  Carry  Select  Adder  (CSLA) 

The  concept  of  CSLA  is  to  compute  alternative 
results  in  parallel  and  subsequently  selecting  the 
correct  result  with  single  or  multiple  stage 
hierarchical  techniques.  In  CSLA  both  sum  and 
carry  bits  are  calculated  for  two  alternatives  Cin=0 
and  1.  Once  Cin  is  delivered,  the  correct 
computation  is  chosen  using  a mux  to  produce  the 
desired  output.  Instead  of  waiting  for  Cin  to 
calculate  the  sum,  the  sum  is  correctly  output  as 
soon  as  Cin  gets  there.  The  time  taken  to  compute 
the  sum  is  then  avoided  which  results  in  good 
improvement  in  speed. 


A3f  B3  A1/B1 


Figure  4Carry  Select  Adder. 


In  general,  we  can  write  the  algorithm  as: 

If  Carry  in  = 1 , then  the  sum  and  carry  out  are  given 

by, 

Sum  (i)  =a  (i)  xor  b (i)  xor  T.  (11) 

Carry  (i+  1)  = (a  (i)  and  b (i))  or  (b  (i)  or  a (i)).  (12) 
If  Carry  in  =0,  then  the  sum  and  carry  out  are  given 
by, 


Figure  3 Carry  Look  Ahead  Adder. 

Let  Gi  is  the  carry  generate  function  and  Pi  be  the 
carry  propagate  function,  Then  we  can  rewrite  the 
carry  function  as  follows: 

Gi  = Ai-  Bi.  (1) 

Pi  = (Aixor  Bi).  (2) 

Si  = Pi  xor  Ci.  (3) 

Ci+1=  Gi  + Pi.Ci.  (4) 

Thus,  for  4-bit  adder,  we  can  compute  the  carry  for 
all  the  stages  as  shown  below: 

Cl  = Go  + Po.Co.  (5) 

C2  = GI  + PI. Cl  = GI  + PI. GO  + PI. PO.CO  (6) 
C3  = G2  + P2.C2=  G2  + P2.G1  + P2.P1  .GO  + 
P2.P1.PO. CO  (7) 

C4  = G3  + P3.C3  = G3  + P3.G2  + P3.P2  .GI  + 
P3.P2.P1.GO  + P3.P2.P1.PO. CO  (8) 

In  general,  we  can  write: 

The  sum  function: 

SUM  i=A  i xor  B i xor  C i=P  i xor  C i 


Sum  (i)  = a (i)  xor  b (i) . 

(13) 

Carry  (i+  1)  = (a  (i)  and  b (i)). 

(14) 

The  sum  function: 

Si=CiSi°+  Ci  Si1. 

(15) 

The  carry  function: 

Ci+i=Ci  Ci+i°  + Ci  Chi1. 

(16) 

MULTIPLIER: 

Multiplication  involves  the  generation  of  partial 
products,  one  for  each  digit  in  the  multiplier,  as  in 
Figure  .These  partial  products  are  then  summed  to 
produce  the  final  product.  The  multiplication  of 
two  n-bit  binary  integers  results  in  a product  of  up 
to  2n  bits  length. 


The  carry  function: 

Ci+1=  Gi  + Pi.Ci. 


On  comparison  with  the  carry  look-ahead  adder 
(CLAA)  based  multiplier  the  area  of  calculation  of 
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the  different  carry  select  adder  (CSLA)  based 
multiplier  is  smaller  and  better  with  nearly  same 
delay  time.  Here  we  are  dealing  with  the 
comparison  in  the  bit  range  of  n*n  (32*32)  as  input 
and  2n  (64)  bit  output. 


Figure  5 Multiplier. 


III.  Simulation  Results 

The  simulation  results  of  CLAA  and  CSLA  based 
multipliers  are  as  shown  in  below  figures 
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Figure  7 Waveform  for  a CSLA  based  multiplier.  [3] 
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Figure8  Timing  Analysis  for  CLAA  based  multiplier. 
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Figure  9 Timing  Analysis  for  CSLA  based  Multiplier 


IV.  Analysis  Table 

In  this  analysis  table  shown  in  figurel4,  the  delay 
time  is  nearly  same,  the  area  and  the  area  delay 
product  of  CSLA  based  multiplier  is  reduced  to  31 
% when  compared  to  CLAA  based  multiplier. 


Multiplier 

type 

Delay(ns) 

Area 

Delay  Area 
Product 

CLAA  Based 
Multiplier 

98.5 

2957  Logic 
Cells 

2912645 

CSLA  Based 

Multiplier 

99.5 

2039  Logic 
Cells 

2028805 

V.  Conclusion  and  Future  Work 

A design  and  implementation  of  a VHDL-based 
32bit  unsigned  multiplier  with  CLAA  and  CSLA 
was  presented.  VHDL,  a Very  High  Speed 
Integrated  Circuit  Hardware  Description  Language, 
was  used  to  model  and  simulate  our  multiplier. 
Using  CSLA  improves  the  overall  performance  of 
the  multiplier.  Thus  a 31  % area  delay  product 
reduction  is  possible  with  the  use  of  the  CSLA 
based  32  bit  unsigned  parallel  multiplier  than 
CLAA  based  32  bit  unsigned  parallel  multiplier. 

This  32  bit  multiplier  can  be  further  extended  to 
64  bit  multiplier  and  128  bit  multiplier  using  the 
proposed  method  for  multiplication  operation  can 
be  done  as  future  work. 
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